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(1) A semiconductor manufacturing device, characterized by the fact that it is equipped with 
a plurality of memory cells arranged in a plurality of rows and a plurality of columns, having a 
first memory cell array, which is divided into a plurality of blocks comprising a plurality of 
column units; 

a row selection means, which is used to select respective rows of said plurality of 
memory cells; 

a column selection means, which is used to select respective columns of said plurality of 
memory cells; 

a block selection means, which inputs a block selection signal; 

a block selection means, which is used to select any block from said plurality of blocks in 
said first memory cell array, in response to said block selection signal; 

a second memory cell array, comprising a plurality of static type memory cells, arranged 
in a plurality of rows and a plurality of columns and divided into a plurality of regions in a 
plurality of column units; 

a region selection means, which is used to select one out of said plurality of regions in 
said second memory cell array in response to said region selection signal; 

a data transmission means, transmitting data between the blocks in said first memory cell 
array, selected with said block selection means, and said second memory cell array, selected with 
said region selection means; 

a first selection means, which can select any information from the information items 
corresponding to a plurality of said static type memory cells, in each of said regions of said 
second memory cell array; and 

and a second selection means, which can select in response to said region selection sing 
any item among said plurality of information items selected per each of said regions by said first 
selection means. 

(2) The semiconductor manufacturing device of claim 1, wherein said row selection means 
selects a row in said first memory cell array in response to the row address signal; 

said column selection means selects a column in response to the column address signal; 
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and said first selection means selects said second memory cell array rows and columns in 
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response to said column address signal; so that data transmission operations are applied to a 
plurality of rows in the same block of said first memory cell array and each row of said second 
memory cell array. 

(3) The manufacturing device of claim 1 or claim 2, wherein the construction of said first 
memory cell array comprises memory cells of the static type, equipped with output terminals 
providing output of data from said first memory cell array, and with output terminals providing 
output of data from said second memory cell array. 

3. Detailed Explanation of the Invention 

(Sphere of Industrial Use) 

This invention relates to a semiconductor memory device for a simple cache system, in 
particular to a semiconductor memory device having a cache memory integrated on the same 
chip. 

(Prior Art Technology) 

According to prior art, in order improve the cost performance of a computer system, a 
high speed memory having a low capacity was deployed as a high-speed buffer between the 
central processing unit (CPU) and the main memory construction designed with a DRAM 
construction for low speed and thus with at a low cost This high-speed buffer is a so called 
cache memory in which a block of data, which is highly like to be required by the CPU, is copied 
from the main memory and stored. The state when data which stored in the DRAM address is 
present in the cache memory so that it could be accesses by the CPU is referred to as Ahit@, as 
the CPU can access the cache memory with a high speed. On the other hand, the state when the 
data stored in the address to be accessed by the CPU is not present in the cache memory is 
referred to as Acache miss@, wherein the CPU accesses the main memory with a low speed, 
while the CPU at the same time transfers to the cache memory the data block to which this data 
belongs. 

However, because expensive high-speed memory is required for such a cache memory 
system, the system could not be used in a compact system in view of the cost. That is why 
conventionally, a general-purpose DRAM was configured as a page mode or static column mode 
configuration. 

Figure 5 is a block diagram showing a basic construction of a conventional DRAM 
element which can be used with the page mode or with the static column mode. 

In this figure, a memory cell array 1 is provided with a plurality of word lines and a 
plurality of bit lines arranged in a mutually intersecting arrangement, while memory cells are 
deployed in the intersection points. The word lines of the memory array 1 are connected through 
a word driver 2 to a row decoder part 3. In addition, the bit lines of the memory cell array 1 are 
connected through a sense amplifier part 4 and an I/O switch part 5 to a column decoder part 6. 



A row address buffer 7 is connected to the row decoder part 3, and a column address buffer 8 is 
connected to the column decoder part 6. A multiple signal MPXA is applied multiplexed with the 
row address signal RA and the column address signal CA to the row address buffer 7 and the 
column address buffer 8. Furthermore, an output buffer 9 and an input buffer 10 are connected to 
the I/O switch part 5. 

Figure 6A, Figure 6B and Figure 6C are waveform diagrams showing the operation of the 
cycle in an ordinary read cycle, page mode cycle and static column mode cycle of the DRAM, 
respectively. 

In the ordinary read cycle shown in Figure 6A, the row address buffer 7 first acquires the 
multiplex address signal MPXA at the falling edge of the row address strobe signal gAS and 
applies it as row address signal RA to the row decoder part 3. The row decoder part 3 selects one 
word line from a plurality of word lines according to this row address signal RA. Because of that, 
information stored in a plurality of memory cells connected to this selected word line is read to 
each word line, and this information is detected and amplified by the sense amplifier part 4. At 
this point in time, information stored in the memory cell of 1 row segment is latched by the sense 
amplifier part 4. 
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Next, the column address buffer 8 acquires the multiplex address signal MPXA at the 
falling edge of the column address strobe signal CAS and applies it as column address signal CA 
to the column decoder part 6. The column decoder part 6 then selects one of the information 
items in 1 line segment latched by the sense amplifier part 4 in response to this column address 
signal CA. This selected information is extracted to an external part as output data Dout through 
the I/O switch part 5 and the output buffer 9. The access time (RAS access time) tRAc is in this 
case the time period valid from the falling edge of the row address strobe signal RAS until the 
output data Dout becomes valid. The cycle time tc is in this case the sum of the time period 
created by the active state of the element and of the time period tRP of the RAS precharge. As a 
standard value, tc is approximately 200 ns when tRAc = 1 00 ns. 

In the page mode and static column mode shown in Figure 6B and Figure 6C, memory 
cells on the same row are accessed by changing the column address signal CA. In the page mode, 
the column address signal CA is latched at the falling edge of the column address row signal 
CAS . In the static column mode, access is achieved by simply changing the column address 
signal CA as in a static RAM (SRAM). The page mode and static column mode CAS, access 
time CAS and address access time Taa therefore create about one 2 of the value of the RAS 
access time tRAC for tf^c " 100 ns., which is about 50 ns. A high speed of the of the cycle time is 
also created in this case, and in the case of the page mode, the value of about 50 ns is obtained in 
the same manner as in the static column mode with the CAS precharge time period value tcp. 



Figure 7 is a block diagram showing the construction of a simple cache system using the 
page mode or static column mode of the DRAM element shown in Figure 5. In addition, Figure 8 
is a waveform diagram explaining the operation of the simple cache system shown in Figure 7. 

As shown in Figure 7, the construction of the main memory 20 comprises 1 M byte 
comprising 8 DRAM elements 21. In this case, a total of 20 bits (2 20 = 1048576 = 1 M) is 
required by the row address signal RA and the column address signal CA. An address 
multiplexer 22, which applies 10-bit row address signal RA and 10-bit column address CA to the 
memory 20 twice, has 19 address lines Ao ~ A 9 , applying to the DRAM elements 21 10-bit 
address signal (multiplex address signal MPXA), which is received as 10-bit address signal and 
multiplexed by 20 address lines Ao ~ A19. 

An address generator 23 generates address signal corresponding to the data requested by 
the CPU 24. A latch (TAG) 25 holds the row address signal RA corresponding to the data 
selected in the previous cycle. A comparator 26 compares row address signal RAL held in the 
TAG 25 to the 10-bit row address signal out of the 20-bit address signal. When both coincide, 
this means that the same row is accessed (hit) as in the previous cycle, the comparator 26 
generates high-level cache hit (Cache Hit) signal CH. A state machine 27, responding to the 
cache hit signal CH, performs page mode control by toggling the address strobe signal CAS and 
maintaining a low level of the row address signal RAS. In response to that, the address 
multiplexer 22 applies the column address signal CA to the DRAM element 21 (see Figure 8). 
Therefore, in the case of such a Ahit@, output data will be obtained at a high speed from the 
DRAM element 2 1 with the access time tcAO 

On the other hand, when the address signal RA obtained from the address generator 23 
does not match the row address signal RAL held by the TAG 25, a different row will be accessed 
than the row accessed in the previous cycle (cache miss), and the cache hit signal CH will not be 
generated at the high level by the comparator 26. 
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In this case, the state machine 27 performs ordinary reading cycle RAS and CAS -control 
and the address multiplexer 22 applies sequentially the row address signal RA and the column 
address signal CA to the DRAM element 21 (see Figure 8). In the case of such a cache miss, an 
ordinary read cycle starts from the RAS precharge, and because output data is obtained with 
low-speed access time tgAc, the state machine 27 generates the wait signal Wait and CPU 24 is 
brought into a wait state. In the case of a cache miss, a new row address signal RA is held in the 
TAG 25. 

Therefore, because in the simple cache system shown in Figure 7, data corresponding to 1 
row segment of the memory cell array of the DRAM element (1024 bits for a 1 M bit element) 
creates 1 block, the size of the block is unnecessarily large and because the number of blocks 



(number of entries) held in TAG 25 is insufficient (1 entry in the system shown Figure 7), this 
created a problem known as a low cache hit rate. 

Moreover, another conventional example of a simple cache system has been disclosed 
also in US Patent Number 4,577,293. This simple cache system has a register holding data in 
1-row segments, which is created outside of the memory cell array, so that in the case of a hit, 
access is achieved with a high-speed design by reading data directly from this register. However, 
also according to this simple cache system disclosed in the US Patent Gazette, the block size is 
unnecessarily large and the problem is that that a low cache hit rate is created in the same manner 
as in the conventional examples shown in Figure 5 and Figure 7. 

That is why a DRAM element provided with a built-in cache memory has been proposed 
as shown in Figure 9. 

The differences between this DRAM element and the DRAM element shown in Figure 5 
are as follows. Specifically, a DRAM memory array 1 is divided into a plurality of blocks 
comprising a plurality of memory cells in the address space of the memory aiTay. In Figure 9, the 
array is divided into 4 blocks Bl ~ B4. Also, a transfer gate part 1 1 and a SRAM memory cell 
array 12 are deployed between a sense amplifier part 4 and an I/O switch part 5. Moreover, a 
block decoder 13 and a window decoder 14 are also used. While one part of the address signal 
CA is supplied from the column address buffer 8 according to the block number to the block 
decoder 13, the activation of the operation is controlled by the cache hit signal CH. Also, a way 
address signal WA is applied through a way buffer 15 to the way decoder 14. The way decoder 
14 is operated to select a word line from the SRAM memory cell array 12 according to the way 
address signal WA. 

Figure 1 0 is a diagram showing the detailed construction in one part of the DRAM 
element shown in Figure 9. 

In Figure 10 is shown a sense amplifier part 4, a transfer gate part 1 1, a SRAM memory 
cell array 12, and an I/O switch part and column decoder part 6. The construction further 
comprises multiple bit line pairs BL, BL of the DRAM memory cell array 1 , and the 
corresponding multiple sense amplifiers 40, transfer gates 1 10, SRAM memory cells 120, and 
I/O switches 50 and column decoders 60. In addition, block decoders 13 are arranged to 
correspond to each block of the DRAM memory cell array 1. Each sense amplifier 40 is 
connected to each bit line pair BL, BL. Also, each bit line part BL, BL is connected through a 
pair of bit lines SBL, SBL to the SRAM memory cell array 12 through the transfer gate 1 10 from 
the N-channel MOSFETs Ql , Q2. The bit line pairs SBL, SBL of the SRAM memory array 12 
are connected to each I/O band I/O, I/O through the N-channel MOSFETs Q3, Q4. A joint 
transmission signal is applied per each block from the block decoder 1 3 to the gate of the 
MOSFETs Ql, Q2 of the transfer gate 110. Also, a column selection signal is applied with a 
corresponding column decoder 60 to each gate of the MOSFETs Q3, Q4 of each I/O switch 50. 
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When the transmission signal is applied to the transfer gate corresponding to each block 
by the block decoder 13 in this DRAM element, data on the same row is transmitted in block 
units from the DRAM memory array 1 to the SRAM memory cell array 12. When one of the 
word lines Wi ~ W„ is selected from the SRAM memory cell array 12 by the way decoder 14, 
the data in each bit line part SBL, SBL that is stored in the SRAM memory cell 120 connected 
to the word lines is read. The data read on the bit line pairs SBL, SBL is read in the I/O band I/O, 
I/O by applying the column selection signal to the I/O switch 50 from the column decoder 60. 

According to this DRAM element, because 1 data block has data in which 1 line 
corresponds to a plurality of columns, multiple data blocks on different rows are held in multiple 
SRAM memory cells 120, and the data blocks on different rows in the same column are held in 
the SRAM memory cell array 12 (associativity). 

Accordingly, because this SRAM memory cell array is used as a cache memory, the 
number of data entries can thus be increased and the result is that the cash hit rate can be 
increased. 

Furthermore, when the word lines Wi ~ W„ of the SRAM memory cell array 12 are 
maintained in the non-active state, during writing operations applied to the DRAM memory cell 
array, but also during reading operation when reading is performed from the DRAM memory 
cell array 1, because a configuration can be created in which transmission to the cache memory 
is not carried out, the advantage is that the extent of the freedom available for the application of 
the cache system is increased. 

Figure 1 1 is a block diagram showing the construction of a simple cache system using the 
DRAM element shown in Figure 9. 

In Figure 1 1 , the construction of a main memory 30 creates a 1 M byte configuration 
comprising 8 DRAM elements 31 in a 1 M x 1 construction. The difference between the memory 
system shown in Figure 1 1 and the memory system shown in Figure 7 is that the number of the 
word lines in the SRAM memory cell array 12 and the number of the block segments (set 
number) of the DRAM element 31 is correspondingly increased in TAG 25 and comparator 26, 
and the cache hit signal CH and the way address signal WA, which are output from the 
comparator 26, are input to the DRAM element 31. Here, the way signal has 2 bits. 

The reference provided in Figure 6A - C and Figure 12 shows operation waveform 
diagrams used to explain the operation of the conventional simple cache system with the 
operation of the simple cache system shown in Figure 11. 

TAG 25 holds row addresses corresponding to the rows selected in the most recent cycle 
for each block: as an address set using multiple sets of caches. Because in this case, the way 
address signal can be considered as a 2-bit signal, 4 sets of row addresses are held. Accordingly, 
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16 address sets will be stored in the TAG 25. In addition, sets of addresses that are often used are 
held in a fixed manner in the TAG 25. 

First, address signal corresponding to data which is requested by the CPU 24 is generated 
by the address generator 23. The comparator 26 compares the address set stored in the TAG 25 
to the multiple bits (2 bits in the example indicated in Figure 9) corresponding to the block 
segments out of the row address signal RA and column address signal CA of 10 bits in the 
address signal. After that, if both items coincide, a cache hit is created and the comparator 26 
will issue a high-level cache hit signal CH and a way address signal WA for the hit block. A state 
machine 27 toggles in response to this cache hit signal CH low address strobe signal RAS so thai 
it is maintained on the low level with the column address strobe signal CAS . In response to that, 
an address multiplexer 22 applies 10-bit column address signal CA to the DRAM element 3 1 
(see Figure 12). At this time, because control is exercised in the DRAM element 31 with the 
cache hit signal CH as shown in Figure 9, the column address signal CA will not be furnished to 
the block decoder 13. 
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Accordingly, a separated state is maintained for the DRAM memory cell array 1 and for 
the SRAM memory cell array 1 2. Also, the data in each of the bit line pairs SBL, SBL is read 
according to the way address signal WA. Further, a conductive state is maintained with a column 
decoder 60 by an I/O switch 50 in response to the column address signal CA. Because of that, 
data present in the SRAM memory 120 is output in response to the column address signal CA 
and the way address signal WA through the I/O band I/O, I/O and the output buffer 9. In the case 
of such a hit, output data can be obtained at a high speed with the access time tcAC as if it were 
the page mode. 

On the other hand, when the address signal generated from the address generate 23 does 
not coincide with the used address set which is held in the TAG 25, a cache miss is created, 
which is why the comparator 26 will not generate high-level cache hit signal CH. 

In this case, the state machine 27 performs an ordinary read cycle of the RAS and CAS 
control signal, and the address multiplexer 22 sequentially supplies to the DRAM element 3 1 the 
row address signal RA and the column address signal CA (see Figure 12). Therefore, because in 
the case of such a cache miss, output data will be obtained with the low-speed access time tRAC, 
the state machine 27 will generate the wait signal Wait and the wait state is applied to the CPU 
24. In the case of the cache miss, the block data contained in memory cells accessed at this time 
is transmitted when the block decoder 13 is in the conductive state through the transfer gate 1 10 
from the bit line pairs BL, BL of the DRAM memory cell array 1 in one batch to the block of the 
SRMA memory 120 selected by the way address signal WA. The content stored in this block of 
the SRAM memory cell 120 can be rewritten in this manner. Also, a new address related to the 
way address signal WA of the corresponding block will be held in the TAG 25. 
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Therefore, because the data of a plurality of blocks is held in the SRAM memory cell 12 
which is used as a cache memory in the simple cache system which uses the DRAM element 
shown in Figure 9, the cache hit rate is increased, which makes it possible to increase the number 
of entries for data sent to TAG 25. 

Also, because the DRAM memory cell arrays is accessed here in the case of a cache miss, 
although transmission of data to the cache memory was performed from the SRAM memory cell 
array in the indicated example, this transmission can be also prohibited when a non-selective 
state is created for all the word lines of the SRAM memory cell array. At the same time, there is 
an option either to select or not to select the transmission to the SRAM memory array cell also 
when writing operations are applied to the DRAM memory cell array. Furthermore, the 
embodiment shown in Figure 1 1 corresponds to a 4-way set of an associative cache. 

(Problems to Be Solved By This Invention) 

Nevertheless, when a cache hit was achieved according to the simple cache systems 
described above, the way address signal WA, which was selected from the address signal to 
access the SRAM memory cell array 12 as a cache memory, was output after a comparison was 
carried out with the comparator 26. Therefore, because the way address signal WA was supplied 
to the DRAM element 31, the operation of the word lines of the SRAM memory cell array 12 
was delayed, and while the device can be used as a cache memory having a high-speed SRAM 
memory cell array 12, the disadvantage of the device is that high-speed access time operations 
cannot be conducted during a hit 

In order to solve the problems mentioned above, the purpose of this invention is to 
provide a semiconductor memory device having a built-in cache memory, which makes it 
possible to create the configuration of a cache system in which high-speed access time operations 
can be conducted during a hit 

(Means to Solve Problems) 

The semiconductor device of this invention is equipped with a first memory cell array, a 
row selection means, a column selection means, a block selection signal input means, a block 
selection means, a second memory cell array, a region selection signal input means, a region 
selection means, a data transmission means, a first selection means, and a second selection 
means. 
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The first memory cell array comprises a plurality of memory cells arranged in multiple 
rows and columns, which are also divided into a plurality of blocks with a plurality of column 
units. The row selection means is used to select each row in multiple memory cells. The column 
selection means is used to select each column in multiple memory cells. The block selection 



signal input means is used to input a block selection signal. The block selection means is used to 
select one out of the plurality of blocks in the first memory cell. 

In addition, the second memory cell array, which comprises multiple static memory cells 
arranged in multiple rows and multiple columns, is divided into multiple regions and into 
multiple column units. The region selection means can be used to select any of the multiple 
regions of the second memory cell array in response to the region selection signal. 

The data transmission means is used to transmit data between the blocks of the first 
memory cell array, selected with the block selection means, and the region of the second 
memory ceil array, selected with the region selection means. 

In addition, the first selection means selects any information corresponding to the 
plurality of static memory cells contained in each region of the second memory cell array. The 
second selection means is used to perform selection in response to the region selection signal to 
select any of the multiple information items selected in each region with the first selection means. 



(Operation) 

In the semiconductor memory device provided with a built-in cache memory relating to 
this invention, data blocks can be held on the second memory cell array in multiple rows of the 
first memory cells, and data blocks can be held in different regions of the second memory cell 
array simultaneously with multiple sets of data blocks having different rows in the same column 
of the first memory cell array. 

Also, the data blocks of different rows in the same column of the first memory cell array 
can be arranged in the same row of the second memory cell array. Therefore, by using the cache 
memory of this second memory cell array, the number of entries for data can be effectively 
increased, which not only makes it possible to increase.the cache hit rate, but also enables 
high-speed access time operations of the cache memory. 

(Embodiment) 

The following as an explanation of one embodiment of this invention which uses the 
enclosed figures. 

Figure 1 is a block diagram which shows the construction of a DRAM element according 
to one embodiment of this invention. 

As this embodiment is the same as the DRAM element shown in Figure 9 with the 
exception of the points described below and the same reference symbols are applied to the 
corresponding parts, an explanation thereof will be omitted when appropriate. 

As shown in the figure, a DRAM memory cell array 1 is divided into multiple blocks on 
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its address space. This embodiment is divided into 4 blocks BK 1 ~ BK4. Moreover, a SRAM 
memory cell array 12 is divided into multiple ways in multiple column units. However, it is also 
possible to use a different number of blocks in the DRAM memory cell array 1 and a different 
number of ways in the SRAM memory array 12. 

Between the DRAM memory cell array 1 and the SRAM memory cell array 12 are 
arranged a sense amplifier part 4, a block transfer gate part 1 1, an internal I/O band 41, and a 
way transfer gate part 42. A block decoder 1 3, responding to one part of the column address 
signal CA, (2 bits in the case of this embodiment), instructs the block transfer gate part 1 1 
whether and which block data of the DRAM memory cell array 1 is to be transmitted. The way 
transfer gate part 42 transmits data transmitted to the internal I/O band 41 to one of the ways of 
the SRAM memory arrays 12. 
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The way decoder 14 instructs the way transfer gate part 42 whether and to which way the 
data of the internal I/O band 41 is to be transmitted in response to the address signal applied 
through the way address buffer 15. 

In the SRAM main memory array 12 is deployed a cache row decoder 43, a cache I/O 
switch part 44, and a cache column decoder 45. The cache row decoder 43 selects 1 row of the 
SRAM memory array 12 in response to the cache row address signal acquired from the cache 
address buffer 46. The cache column decoder part 45 selects 1 column in each of the ways in 
response to the cache column address applied from the cache address buffer 46. The cache 
address buffer 46 inputs the column address signal CA applied to the DRAM memory cell array 
1 as the cache address signal CCA. A part thereof is applied as a cache row address to the cache 
row decoder 43, and the other part is applied as a cache column address signal to the cache 
column decoder 43. Multiple SRAM sense amplifiers 47, corresponding to each of the ways of 
the SRAM memory cell array 12, are connected to the cache I/O switch part 44 thought 
respective I/O lain pairs I/Oa - I/Od. 

The data contained in the SRAM memory cell array 12, selected for each way with the 
cache row decoder 43 and with the cache column decoder part 45, is detected and amplified by 
each corresponding SRAM sense amplifier 47. A way selector 48 selects one data item from the 
data applied via a plurality of SRAM sense amplifiers 47 in response to the way address signal 
WA applied from the way address buffer 1 5, and outputs through an output buffer 9b cache 
output data Dout to an external part. When data applied to the input buffer 10b as cache input 
data Din is written to 1 of the memory cells of the SRAM memory cell array 12, the opposite 
path to that described above is used. 

Figure 1 shows the status when data A|, B|, Ci and D| in each of the rows of the block 
BK1 of the DRAM memory cell array 1 is transmitted to the same row of each way A, B, C and 
of the SRAM memory cell array 12. 
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Figure 2 is a diagram showing a detailed construction of one part of the configuration 
shown in Figure I. 

In each of the blocks BK1 ~ BK4 of the DRAM memory cell array 1 , the sense amplifier 
part 4 and the block transfer gate part 1 1, the sense amplifier part 4 and the block transfer part 11 
comprise n sense amplifier parts 4 and n block transfer gates 1 1 0, which are provided with 
corresponding n bit lines BLi ~ BL„. Also, the internal I/O band 41 comprises n I/O lines I/O] ~ 
I/O n . The bit lines BLi ~ BU in each block are connected to respective corresponding I/O line 
pairs I/Oi ~ I/O n through the sense amplifier 40 and block transfer gate 110. 

On the other hand, the SRAM memory cell array 12 is divided into 4 ways and each way 
comprises a SRAM memory cell 120 with n columns, that is to say n bit line pairs SBLi ~ SBLn. 
Each of the ways comprises a way transfer gate part 42, and way transfer gate 420 with 
corresponding n bit line pairs SBLi ~ SBL n . In each of the ways, respective n bit line pairs SBLi 
~ SBLn are connected through the way transfer gate 420 to the corresponding I/O line pairs I/Oi 
- I/O n . The cache I/O switch part 44 comprises cache I/O switches 440 corresponding to 
respective bit line pairs SBLi ~ SBU of the SRAM memory cell array 12, and 4 respective 
corresponding I/O lines I/0 A ~ I/0 D . The n bit line pairs SBLi ~ SBU belonging to each of the 
ways are connected respectively through the cache I/O switch 440 to the I/O line corresponding 
to the way. For example, the bit line pairs SBL, SBL„ belonging to the way C, are all connected 
to the I/O line pair I/Oc- 
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In addition, a cache column decoder part 45 is deployed for each of the ways. The cache 
column decoder part 45 of each of the ways comprises cache column decoders 450 
corresponding to each column. Each of the cache column decoders 450 is connected to the MOS 
transistor gate of the corresponding cache I/O switch 440. 

Figure 3 is a block diagram showing a simple cache system using the DRAM element of 
Figure 1. 

As shown in Figure 3, the configuration of a main memory 30 comprises 1 M byte having 
8 DRAM elemental provided with the 1 M x 1 construction. Unlike in the memory system 
shown in Figure 1 1, in the memory system shown in Figure 3, 1 0-bit address signal 
corresponding to said column address signal multiplexed by the multiplexer 22 is input to the 
DRAM elements 3 1 as cache address signal, replacing the cache hit signal CH output from the 
comparator 26. Another difference is that the data selector signal, generated by the state machine 
27, corresponding to the cache hit signal CH, is input to a data selector 51. The data selector 51 
selects and outputs DRAM data DD, applied from the DRAM element 31 in response to the data 
selector signal DS, or cache data CD. 
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The operation of the simple cache system shown in Figure 3 will now be explained while 
referring to the operation of the waveform diagram shown in Figure 4. 

TAG 25 holds address sets which are used by a plurality of caches for row addresses 
corresponding to the rows selected in the most recent cycle for each individual block. In this case, 
because the way address signal WA can be considered as a 2-bit signal, 4 sets of row address are 
held. Therefore, when 4 blocks are used, 16 address sets will be stored in TAG 25. In addition, 
addresses that are often used are held in a fixed manner in the TAG 25. 

First, address signal corresponding to the data which is requested by the CPU 24 is 
generated by the address generator 23. 

The comparator 26 compares the address set hold in the TAG 25 to multiple bits (2 bits in 
the example indicated in Figure 3) corresponding to a block segment out of the column address 
signal CA and the row address signal RA, with 10 bits out of the 20-bit address signal. After that, 
if there is a coincidence between both items, a cache hit is created, and the comparator 26 will 
generate the cache hit signal CH on the high level and the way address signal WA for the hit 
block. 

Before the comparison of the address signal is carried out by this comparator 26, 
assuming that a cache hit occurs, 10-bit cache address signal CCA is input to the DRAM element 
31 so that SRAM cell reading operation will proceed. Therefore, when a cache hit occurs and the 
way address signal WA is input, the desired data is output at a high speed through the cache 
output buffer 9b as cache data CD, and cache memory data is obtained from the data selector 5 1 
with the data select signal DS, which is generated in response to the cache hit signal CH. 

Conversely, when the address signal which has been input to the comparator 26 does not 
coincide with the address set held in the TAG 25, a cache miss is created, and the cache hit 
signal CH will not be generated by the comparator 26. Because of that, the cache data CD output 
from the SRAM memory cell will be ignored. In this case, the state machine 27 performs RAS 
and CAS s ignal control in the ordinary read cycle, and the address multiplexer 22 supplies to the 
DRAM element 3 1 sequentially the row address signal RA and the column address signal CA 
(see Figure 4). Therefore, because output data is obtained with the low-speed access time Trac ip 
the case of such a cache system, wait signal Wait is generated by the state machine 27 and the 
CPU 24 is brought into the standby state. 
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In the case of a cache miss, the block data contained in the memory cell accessed at that time is 
transferred to the I/O line pairs I/O) ~ I/O n of the internal I/O band 41 through the block transfer 
gate 1 10 when a conductive state is created by the block data 13. Also, this data is transferred to 
a suitable SRAM memory cell array 12 through the way transfer gate which is selected by the 
way address signal WA, and the content stored in the SRAM memory cell 120 in the row 
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selected by the cache row decoder 43 is rewritten. In addition, a new address set accessed at this 
time is held in the TAG 25, which relates to the way corresponding to this data block. 

As was explained above, the present embodiment makes it possible to increase the 
number of data entries to the TAG 25 because data corresponding to a plurality of blocks can be 
held in the SRAM memory cell array 120 as in cache memory. The result is that the probability 
of a hit is increased, while another effect is that the cache memory can be accessed with an 
access time at a high speed. 

(Effect of the Invention) 

As was explained above, since the present invention makes it possible to hold a great 
number of data blocks of the first memory cell array in the second memory cell array without 
increasing unnecessarily the block size, the number of data entries can thus be effectively 
increased. 

Moreover, because data blocks of different rows relating to one column of the first 
memory cell array is stored in the same row on the second memory cell array, any of the regions 
can be selected after the information has been read from each region of the second memory cell 
array. Therefore, since the second memory cell array can be used as a cache memory, access is 
enabled at an extremely high speed when there is a cache hit Accordingly, when the 
semiconductor memory device of this invention is used, this makes it possible to create the 
configuration of a simple set-associative cache system enabling cache hit operations at a high 
speed. 

4. Brief Explanation of Figures 

Figure one is a construction block diagram of a semiconductor memory device according 
to one embodiment of this invention, Figure 2 is a block diagram showing the details of the 
construction of one part of the semiconductor memory device shown in Figure 1, Figure 3 is a 
block diagram showing the construction of a simple set-associative cache system which utilizes 
the semiconductor memory device shown in Figure 1 , Figure 4 is an operation waveform 
diagram explaining the operation of the simple cache system of Figure 3, Figure 5 is a block 
diagram showing the construction of a conventional DRAM element, Figure 6A is an operation 
waveform diagram of an ordinary read cycle of a DRAM element according to prior art, Figure 
6B is an operation waveform diagram of the page mode cycle of a DRAM element according to 
prior art, Figure 6C is an operation waveform diagram of the static column mode of a DRAM 
element according to prior art, Figure 7 is a block diagram showing the construction of a simple 
cache system utilizing the DRAM element shown in Figure 5, Figure 8 is an operation waveform 
diagram of the simple cache system of Figure 7, Figure 9 is a block diagram showing the 
construction of a DRAM element provided with a built-in cache memory, Figure 10 is a block 
diagram showing the detailed construction of one part of the DRAM element of Figure 9, Figure 
1 1 is a block diagram showing the construction of a simple cache system utilizing the DRAM 
element of Figure 9, and Figure 12 is an operation waveform diagram of the simple cache system 
of Figure li. 
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In these figures, 1 is a DRAM memory cell array, 2 is a word driver, 3 is a row decoder 
part, 4 is a sense amplifier part, 5 is an I/O switch part, 6 is a column decoder part, 7 is a row 
address buffer, 8 is a column address buffer, 9a, 9b are output buffers, 10a, 10b are input buffers, 
1 1 is a block transfer gate part, 12 is a SRAM memory cell array, 13 is a block decoder, 14 is a 
way decoder, 15 is a way address buffer, 41 is a built-in I/O belt, 42 is a way transfer gate part, 
43 is a cache row decoder, 44 is a cache I/O switch part, 45 is a cache column decoder part, 46 is 
a cache address buffer, 47 is a sense amplifier for SRAM, 48 is a way selector, BL, BL is a pair 
of bit lines of a DRAM memory cell array, and SBL, SLB is a pair of bit lines of a SRAM 
memory cell array. 

In addition, the same symbols indicate the same or corresponding parts in the figures. 
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